A study on the natural-sounding Japanese phonetic word synthesis by using the VCV-balanced word database that consists of the words uttered forcibly in two types of pitch accent

نویسندگان

  • Ryo Mochizuki
  • Yasuhiko Arai
  • Takashi Honda
چکیده

In order to synthesize natural-sounding Japanese phonetic words, a novel VCV-concatenation synthesis with an advanced word database is proposed. The word database consists of VCVbalanced phonetic words which are uttered forcibly in type-0 and type-1 pitch accents. The advantage of using the advanced word database is that a variety of VCV-segments with the same phonetic chains and the different pitch patterns could be collected efficiently at the same time. The following pitch modification techniques are used to achieve the sound quality: (1) The optimal VCV-segment set which minimizes the pitch modification rate is selected. (2) Pitch waveforms are extracted by referring to excitation points. (3) Wavelengths of pitch waveforms are adjusted depending on the pitch modification rates. (4) Natural prosody in the VCV-segments in the database is effectively used. Superiority of the proposed database is ensured by means of the pitch pattern matching measurement and the subjective quality evaluation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تکیه در زبان فارسی

Abstract: This research has been carried out in the framework of Auto segmental-metrical (AM) phonology to study the stress in Persian. Two types of abstract and concrete prominences were distinguished in which the first one refers to the stress and the second one refers to the pitch accent. Stress is assumed to be a lexical property of the lexemes, but pitch accent is assumed to be an intonati...

متن کامل

Accent Sandhi Estimation of Tokyo Dialect of Japanese Using Conditional Random Fields

When synthesizing speech from Japanese text, correct assignment of accent nuclei for input text with arbitrary contents is indispensable in obtaining naturally-sounding synthetic speech. A phenomenon called accent sandhi occurs in utterances of Japanese; when a word is uttered in a sentence, its accent nucleus may change depending on the contexts of preceding/succeeding words. This paper descri...

متن کامل

Prosodic transfer in L2 relative prominence distribution: the case study of Japanese pitch accent produced by Italian learners

Relative prominence distribution, one of the major factors characterizing speech rhythm, is largely determined not only by the position of word accent/stress (word accent, henceforth) but also by the treatment of the acoustic correlates involved in word accent production (e.g., duration, F0, amplitude). Languages differ in both aspects, and those differences are expected to cause prosodic trans...

متن کامل

Word segmentation in Persian continuous speech using F0 contour

Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...

متن کامل

Pitch Accent in Japanese: Implementation by the C/D Model

In Tokyo Japanese, lexical accent is implemented by pitch pattern control, while phrasal stress patterns, along with pitch variation, convey non-lexical information in discourse. The C/D model represents pitch control by the tonal melody and stress control by the skeletal organization of the utterance. Phonetic implementation of pitch contours is exemplified here for different lexical accent pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998